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8 Related Applications 

9 This application is a continuation-in-part of U.S. Serial 

10 Number 08/193,707 filed February 9, 1994, which in turn is a 

11 divisional of U.S. Serial Number 07/820,364, filed January 14, 

12 1992, now U.S. Patent 5,313,421 issued May 17, 1994. 
13 

14 Technical Fiejfl 

15 This invention pertains to semiconductor memory cells and 



16 arrays, more particularly to electrically erasable programmable 

17 read only memories. 
18 

19 pacfrgrQimd 

20 Erasable programmable read only memories (EPROMs) and 

21 electrically erasable programmable read only (EEPROMs) are well 

22 known in the art. These devices have the ability to store data 
2 3 in non-volatile fashion, while also being capable of being 
24 erased and rewritten as desired. EPROM devices are typically 
2 5 erased by exposing the integrated circuit device to ultraviolet 
2 6 radiation, while EEPROMs allow erasure to be performed 

27 electrically. 

28 One form of EEPROM device includes a so-called "split-gate" 

29 electrode, in which the control gate includes a first portion 

30 overlaying a floating gate and a second portion directly 

31 overlaying the channel. Such a split gate structure is 

32 described in a 5-Volt-0nly Fast-Programmable Flash EEPROM Cell 

33 with a Double Polysilicon Split-Gate Structure by J. Van Houdt 

34 fit al, Eleventh IEEE Non-Volatile Semiconductor Workshop, 
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1 February 1991, in which charge is injected into the floating 

2 gate from the source side of the cell. U.S. Patent 4,652,897 

3 describes an EEPROM device which does not utilize a split-gate, 

4 but which also provides injection to the floating gate from the 

5 source side of the device. 

6 As described in the above referenced U.S. Patent 4,652,897, 

7 memory cells are typically arranged in an array, as is well 

8 known in the art. One form of such an array utilizes buried 

9 diffusions, in which source and array regions are covered with 

10 a fairly thick layer of insulating material. This is shown for 

11 example, in U.S. Patents 4,151,020; 4,151,021; 4,184,207; and 

12 4,271,421. Such buried diffusion devices often utilize a 

13 virtual ground approach, in which columns connecting the sources 

14 of a first column of memory cells also serves to connect drains 

15 of an adjacent column of memory cells. 

16 While many EEPROM devices utilize two layers of 

17 polycrystalline silicon, one for the formation of the floating 

18 gate, and the other for the formation of the control gate and 

19 possibly electrical interconnects, other EEPROM devices utilize 

20 three layers of polycrystalline silicon. For example, U.S. 

21 Patent 4,302,766 provides a first polycrystalline silicon layer 

22 for the floating gate, a second polycrystalline silicon layer 

23 for the control gate, and a third polycrystalline silicon layer 

24 coupled through an erase window to a portion of the first 

25 polycrystalline silicon layer for use during erasure of the 

26 cell. U.S. Patent 4,331,968 also uses a third layer of 

27 polycrystalline silicon to form an erase gate, while U.S. Patent 

28 4,462,090 forms an addressing gate electrode utilizing a third 

29 layer of polycrystalline silicon. U.S. Patent 4,561,004 and 

30 4,803,529 also use three layers of polycrystalline silicon in 

31 their own specific configurations. 

32 Japanese Patent Publication 61-181168 appears to utilize 

33 three layers of polycrystalline silicon to provide additional 

34 capacitive coupling to the floating gate. Japanese Patent 
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Publication 63-2 65391 appears to pertain to a buried diffusion 
array, possibly utilizing virtual grounds. 

European Patent Application 037383 0 describes an EEPROM in 
which two polycrystalline silicon layers are used, with the 
second layer of polycrystalline silicon having two pieces, one 
of which provides the erase function, and one of which provides 
the steering function. 

"A New Flash-Erase EEPROM Cell With a Sidewall Select-Gate 
on its Source Side" by K. Naruke et al. IEDM-89-603 and U.S. 
Patent 4,794,565 describe an EEPROM utilizing a. side wall select 
gate located on the source side of the field effect transistor. 

" EPROM Cell With High Gate Injection Efficiency" by 
M. Kamiya gt al. I EDM 82-741, and U.S. Patent 4,622,656 describe 
an EEPROM device in which a reduced programming voltage is 
provided by having a highly doped channel region under the 
select gate, and the channel region under the floating gate 
being either lightly doped or doped to the opposite conductivity 
type, thereby providing a significant surface potential gap at 
the transition location of the channel . * 

In recent years there has been significant interest in 
producing high capacity FLASH memory devices which use split- 
gate, source-side hot electron programming, i» place of the more 
conventional drain-side channel hot electron (CHE) mechanism. 

The reasons for this include its inherently lower write 
power requirement (l/10th that of CHE or less) , facilitating low 
voltage operation and higher write speeds via increased 
parallelism. In addition, the split gate structure is not 
susceptible to "over erase" related problems (a problem for 
single gate FLASH memories such as ETOX) , and does not 
experience programming difficulty due to strong overerase, which 
can hinder programming after an erasure operation in split-gate 
CHE programming devices. 

In view of these benefits, SunDisk Corporation has patente 
FLASH memory cell and array variants which use source side 
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1 injection integrated with SunDisk's proprietary thick oxide, 

2 poly-to-poly erase tunneling technology, to make a highly 

3 scalable, reliable, low power programming cell (D.C. Guterman, 

4 G. Samachiasa, ¥. Fong and E. Harari, U.S. Patent No. 

5 5,313,421) • 

6 The concept of a multi-bit storage non-volatile cell using 

7 a split gate structure was described by G.S. Alberts and H.N. 

8 Kotecha ( Multi-bit storage FET EAROM cell . IBM Technical 

9 Disclosure Bulletin, Vol. 24 No. 7A, p. 3311, Dec. 1981). They 

10 describe a two-poly, three transistor element-in-series cell, in 

11 which the center transistor's channel is controlled directly by 

12 the poly2 control gate (which also serves as the cell select 

13 gate) , and each of the two end transistor channels are 

14 controlled by corresponding polyl floating gates, which in turn 

15 are capacitively coupled to the control gate, thereby realizing 

16 a plurality of bits in the one physical cell structure. 

17 Recently, at the 1994 I EDM, Bright Microelectronics along 

18 with Hyundai presented a similar dual-bit split-gate cell, 

19 integrated into a contactless, virtual ground array, and using 

20 source side injection programming (Y.Y. Ma and K. Chang, U.S. 

21 Patent No. 5,278,439 - referred to henceforth as the Ma 

22 appxroach) . One structural difference here from the IBM approach 

23 is their separation of the capacitively coupling control gates, 

24 which are formed in poly2, and the select gate, which is formed 

25 in poly3. 

26 In the Ma approach, they use "conventional" negative 

27 control gate driven tunneling through an ultra-thin polyl gate 

28 oxide (about 100A or less) . This erase approach poses some 

29 serious limitations. Erase of one of the two storage 

30 transistors uses floating gate to drain tunneling through the 

31 ultra-thin oxide, accomplished by biasing the drain to 7v and 

32 corresponding control gate to -lOv. Because both of these lines 

33 run perpendicular to the select gate, this forces a block of 

34 cells which are to be simultaneously erased (e.g. a sector) to 
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1 be bit line oriented, as opposed to the more conventional word 

2 line (select gate) oriented, block; i.e. its sector must be 

3 column organized and thus it cannot be row organized. (For 

4 example, a sector could be two columns of floating gates 

5 straddling a bit line/diffusion, including the right hand 

6 floating gates of the left side cells' floating gate pair plus 

7 the left hand floating gates of the right side cells.) Tb 

8 leads to the following disadvantages in the Ma implementation: 
(1) Limited to column sector architecture; i.e. cannot readily 
support the higher read performance row oriented sector 

11 architecture. (Since here, within a sector, both erase anode 

12 and corresponding control gates run perpendicular to row line 

13 direction, this precludes the massively parallel "chunk" 

14 implementation of the row oriented sector, which can 

15 simultaneously access large numbers of cells within that 

16 sector) . 

17 (2) Reguires ultra-thin, approximately 100A, tunneling oxide, 

18 imposing following limitations: 

19 * Scaling limitation associated with pushing the limits 

20 of usable oxide thicknesses, plus the additional area needs 

21 associated with maintaining adeguate coupling reguirements, 

22 which must combat the inherently high capacitance per unit area 

23 of such a thin oxide; 

24 * A myriad of potential retention/reliability problems 

25 inherent to using ultra-thin oxide, combined with the parasitic 

2 6 band-to-band tunneling/hole injection associated with the high 

27 substrate fields adjacent to the diffusion anode; and 

28 * Negative gate bias reguirements on control gate, to 

29 limit band-to-band injection problems, impose process and 

3 0 circuit complexity, plus potentially more layout area 
31 reguirement. 

32 

33 SUMMARY OF THE INVENTION 
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1 In accordance with the teachings of this invention, novel 

2 memory cells are described utilizing source-side injection. 

3 Source-side injection allows programming utilizing very small 

4 programming currents. If desired, in accordance with the 

5 teachings of this invention, to-be- programmed cells along a 

6 column are programmed simultaneously which, due to • the small 

7 programming current required for each cell, does not require an 

8 unacceptably large programming current for any given programming 

9 operation. In one embodiment of this invention, the memory 

10 arrays are organized in sectors with each sector being formed of 

11 a single column or a group of columns having their control gates 

12 connected in common. In one embodiment, a high speed shift 

13 register is used in place of a row decoder in order to serially 

14 shift in the data for the word lines, with all of the data for 

15 each word line of a sector being contained in the shift register 

16 on completion of its serial loading. In one embodiment, 

17 additional speed is achieved by utilizing a parallel loaded 

18 buffer register which receives data in parallel from the high 

19 speed shift register and holds that data during the write 
'20 operation, allowing the shift register to receive serial loaded 

21 data during the write operation for use in a subsequent write 

22 operation. In one embodiment, a verification is performed in 
2 3 parallel on all to-be-programmed cells in a column and the bit 
2 4 line current monitored. If all of the to-be-programmed cells 
25 have been properly programmed, the bit line current will be 

2 6 substantially zero. If bit line current is detected, another 

27 write operation is performed on all cells of the sector, and 

28 another verify operation is performed. This write/ verify 

29 procedure is repeated until verification is successful, as 

3 0 detected by substantially zero bit line current. 

31 Among the objectives of the novel cells constructed in 

3 2 accordance with this invention are avoidance of programming 

33 limitations such as: 
34 
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1 1- High Channel Currents (Power) required for 

2 Programming. 

3 2. High Drain Voltage Requirements, which increase with 
' 4 increased levels of erasure. 

5 3. Loss of Read Performance associated with an increase 

6 in Programming Efficiency via Heavy Channel doping. 

7 4. Program Wearout Associated with Maintaining a High 

8 Drain Bias on Cells exposed to this bias, including 

9 both those cells targeted for programming and those 
10 cells not targeted but still exposed to the voltage. 
11 

12 In an alternative embodiment of this invention, a multi-bit 

13 memory cell is taught utilizing a 3-poly, 3 transistor element - 

14 in-series cell in which the center transistor's channel is 

15 controlled directly by the poly 3 control gate (which serves as 

16 both a cell select gate and erase anode) and each of the two end 

17 transistor channels are controlled by corresponding polyl 

18 floating gates, which in turn are capacitively coupled to the 

19 poly 2 control or steering gates, thereby realizing a plurality 

20 of bits in the one physical cell structure. 

21 T he multi-bit cell contains two bits per unit memory cell, 

22 coming from two floating gate portions, each having their own 

23 control gate (which, in the virtual ground array, runs parallel 

24 to the bits lines) , and sharing one select gate, placed 

25 physically between them (which, in the virtual ground array, 

26 runs perpendicular to the bit lines) . The diffusion BN+ 

27 source/drains straddle the two floating gates, on their opposite 

28 facing channel edges to those adjacent the select gate/transfer 

29 channel. 

0 

30 > Unlike a single floating gate cell, because here the two 

31 floating channels lie in a series configuration, the programmed 

32 threshold voltage level of each floating gate must be limited in 

33 its upper value in order to be readable (similarly to the 

34 Toshiba NAND cell). In this way, either floating gate channel 
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1 can be unconditionally turned on (i.e. independent of its stored 

2 state) when appropriate bias is applied to its corresponding 

3 control gate, when reading the state of the other floating gate. 
4 

5 

6 BRIEF DESCRIPTION OF THE DRAWINGS 

7 Figures la, lb, and lc, are cell layout, cross-sectional 

8 diagram, and equivalent circuit schematic of one embodiment of 

9 this invention; 

10 Figure Id is a plan view of one embodiment of an array 

11 consisting of a plurality of cells of Figures la-lc; 

12 Figure le is a block diagram depicting a memory array 

13 organized by sectors, with appropriate control circuitry; 

14 Figure If depicts the operation of one embodiment of a 

15 memory array organized by sectors as shown in Figure le; 

16 Figure lg is a plan view depicting an alternative array 

17 embodiment utilizing cells depicted in Figures la-lc; 

18 Figure 2a is a cross-sectional view depicting an 

19 alternative embodiment of this invention similar that of Figure 

20 lb; 

21 Figure 2b is a plan view of one embodiment of an array of 

22 memory cells constructed utilizing cells depicted in the cross- 

2 3 sectional view of Figure 2a; 

24 Figure 2c is a diagram depicting the organization and 

25 operating condition of an array such as that of Figure 2b; 

26 Figure 3 is a graph depicting the operation of a memory 

27 cell of Figure lb; 

28 Figure 4 depicts the electrical field distribution along 

29 channels of the device of Figure 5; 

3 0 Figure 5 is a cross-sectional view one embodiment of a 

31 2-poly cell of this invention; 

32 Figure 6 is a cross-sectional view of another embodiment of 
3 3 a 2-poly cell of this invention; 
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1 Figure 7a is a plan view depicting. a portion of a process 

2 sequence utilized in accordance with one embodiment of this 

3 invention; 

4 Figure 7b is a cross-sectional view of the embodiment shown 

5 in the plan view of Figure 7a; . 

6 Figure 8 is a cross-sectional view depicting a fabrication 

7 step suitable for use in accordance with the teachings of this 

8 invention; 

9 Figures 9a and 9b are top and cross-sectional views, 

10 respectively of one embodiment of a multiple-bit memory cell 

11 structure of this invention; 

12 Figure 10a is a schematic diagram of one multi-bit cell of 

13 this invention; 

14 Figure 10b is a circuit diagram depicting one embodiment of 

15 an array of multiple-bit memory cells of this invention, such as 

16 those of Figures 9A and 9B; 

17 Figure 10c is a circuit diagram depicting one embodiment of 

18 an array of cells as shown in Figure 10b plus segment decode 

19 transistor matrix for both bit lines and steering lines; 

20 Figures 11a through lie are detailed top and cross- 

21 sectional views; and 

22 Figures 12a-12f are cross sectional views depicting 

23 fabrication steps suitable for use in fabricating multi-bit 

24 memory cells in accordance with this invention. 
25 

26 DESCRIPTION OF SPECIFIC EM BODIMENTS 

27 The cell layout, cross-sectional diagram and equivalent 

28 circuit schematic of one embodiment are shown in Figures la, lb, 



29 and lc, respectively. Similar reference numerals have been used 

30 in Figures la, lb, and lc. Referring to the cross-sectional 

31 view of Figure lb, this embodiment of the novel EEPROM cell, 

32 101, of this invention includes a buried source region 102 and 

33 a buried drain region 103, each being buried by a relatively 

34 thick layer of dielectric 104 and 105, respectively. Channel 

19619 

062298 9 



( > c) 

1 region 106 is divided into two portions, a first portion 106-1 

2 which is influenced by the third layer polycrystalline silicon 

3 109 and which forms a select gate, and a second portion 106-2 

4 which is influenced by floating gate 107 formed of a first layer 

5 of polycrystalline silicon and which, in turn, is influenced by 

6 control gate 108 formed of a second layer polycrystalline 

7 silicon. As is well known in the art, suitable dielectric 

8 layers such as thermally grown oxide are located between channel 

9 106 and polycrystalline silicon layer 109 and polycrystalline 

10 silicon layer 107. Similarly, suitable dielectric layers such 

11 as oxide or composite oxide/nitride are formed between the three 

12 layers of polycrystalline silicon. Polycrystalline metal 

13 silicide can be used in place of one or more of the 

14 polycrystalline silicon layers 108 and 109. If desired, a 

15 highly-doped P+ region 120 is used within channel 106-2 adjacent 

16 buried drain region 103. This region 120 is formed, for 

17 example, as a double diffused MOS (DMOS) region in order to 

18 establish the threshold voltage V t of the memory transistor 

19 including channel 106-2. This helps to provide a stable 

20 threshold voltage, even though the amount of charges trapped in 

21 the gate oxide layer in the vicinity of the gap between 106-1 

22 and 106-2 tends to increase with a large number of programming 

23 cycles. 

24 An example of operating conditions and levels associated 

25 with the embodiment of Fig. lb are shown in Table 1. High 

26 efficiency programming comes about by the ability to 

27 simultaneously create a high field region in channel 106-2 under 

28 the floating gate, which under the bias conditions of Table 1 

29 occur near the gap between channels 106-1 and 106-2 (see above 
3 0 mentioned I EDM article of Kamiya for theory) while maintaining 

31 a low channel /current. Since this high field causes electron 

32 injection to floating gate 107 near the source side of channel 

33 106-2, this type of operation is termed "source-side" injection. 

34 This mechanism provides high efficiency, low power programming 



19619 

062298 10 



1 by maintaining a low channel current via word line 109 

2 throttling by using a bias operating near channel threshold, 

3 VT p3 « A major attribute of this type of operation is that it 

4 allows for a high drive condition in floating gate channel 106-2 

5 under the floating gate (in fact it thrives on it) , of fering 

6 high-performance read, without degrading programming 

7 performance. This is because the very weak drive condition on 

8 the select transistor of channel 106-1 is established via the 

9 throttling mentioned above to achieve the high fields in the 

10 vicinity of the poly 3/poly 1 gap. These fields accelerate the 

11 electrons to sufficiently energetic levels (i.e. > 3.1 eV) to 

12 surmount the Si/Si0 2 interface barrier at the source side of 

13 floating gate 107. Furthermore, there is a significant vertical 

14 component to that field (i.e. normal to the Si/Si0 2 surface) 

15 driving the electrons up to the surface of channel 106, and 

16 thereby assisting the injection into floating gate 107. No read 

17 performance penalty is incurred to establish this high field 

18 condition. This is in stark contrast to conventional drain side 

19 programming, wherein efficient program requires strong channel 
2 0 saturation which shuns high floating gate channel drives, strong 

21 overerase, or a weakly turned on series select transistor. 

22 These problems with drain side programming dictate high channel 

23 currents, care in overerase, potentially high drain voltages, 
2 4 and unfavorable fields (potentially subducting the channel below 
25 the surface at the drain side and driving electrons downward 

2 6 away from the floating gate) . 

27 Consequently, in accordance with the teachings of this 

28 invention, programming efficiencies (I g /Id) tanging from 

29 10 to 10~3 are possible, with f in the range of 1mA during 

3 0 programming, which is two to three orders of magnitude smaller 
31 than conventional drain side programming. This offers the 
3 2 potential for very fast system level programming by allowing the 
33 programming of 100 times as many memory cells in parallel, 
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1 thereby achieving a 100 fold increase in effective program speed 

2 compared with prior art drain side programming. 
3 



4 T ABLE 1 

5 

6 State Table & Operating Conditions (Fig. lb) 

7 

8 

9 Poly 3 Poly 2 Drain Source 

10 Node ' , (Word (Steering (BN & & (BN 

11 Operation l ine) Gate) Drain) Source) 

12 

13 R STANDBY 0v 0v l.Ov or l.Ov or 

14 E 0v 0v 

15 R L READ SELECTED 5v Ov l.Ov or 0v or 

16 E A Ov l.Ov 

17 AT READ UNSELECTED 5v Ov l.Ov l.Ov 

18 D E 

19 D 
20 

21 

22 • 

23 E R ERASE UNSELECTED 5v Ov Ov Ov 

24 R E 

25 A L 

26 S A 

27 E T ERASE Option 1 5v -10to-17v Ov Ov 

28 E or Option 2 12-22v Ov Ov Ov 

29 ' D 

30 

31 PR PROGRAM H1.5V 14-20V 5-7v Ov 

32 RE SELECTED 

33 O L 

34 G A 

35 R T PROGRAM Ov 14-2 Ov 5-7v Ov 

36 A E UNSELECTED H1.5V 14-20V 5-7v 5-7V 

37 M D OV 14-20V Ov Ov 
38 

39 

40 

41 A major feature of the cell of this invention is the 

42 decoupling of the select function (in this case poly 3 select 

43 transistor 110 in Figure lb) from the steering function (poly 2 

44 control gate 108) . During programming, this allows the 

45 independent control of cell selection/drain current throttling 

46 via poly 3 word line 109 bias (biased at slightly higher than 
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1 VT p3 ) anci stron< ? positive voltage coupling onto floating gate 107 

2 (by raising poly 2 control gate 108 to a high voltage, such as 

3 about 12 volts) . Also, in accordance with the teachings of this 

4 invention, the drain voltage can be adjusted independently of 

5 steering and select transistor voltage levels, to optimize 

6 programming . 

7 During read, the decoupling feature of this invention 

8 provides two important advantages, and one exciting side benefit. 
9 

10 1. The ability to set control gate 108 at the optimum voltage 

11 level for memory state sensing, i.e. the best balanced reference 

12 point for both programmed and erased states. This independence 

13 is in contrast to conventional cells wherein the control gate 

14 also serves as the select transistor, dictating a voltage level 

15 consistent with selection (e.g. Vcc =5v +_ 10%). 
16 

17 2. Improved margin by virtue of being a fixed, (potentially 

18 regulated) reference voltage, eliminating the Vcc variation of +_ 

19 10% inherent to the word line bias levels. (This alone could 

20 improve the floating gate memory window by about 0.6v). 
21 

22 3. A side benefit of the ability to independently set the 

23 control gate voltage bias discussed above, offers the possibility 

24 of a simple way for re-referencing the memory cell for multi- 

25 state (i.e. more than conventional 2-state) encoded data. For 

26 example if the cell is encoded into three level states, (such as 

27 logical 1 = strongly erased/high conducting, logical 2 = 

28 partially programmed/ weakly conducting; logical 3 = strongly 

29 programmed,) then the control gate voltage can be set at two 

30 different levels in a two pass read scheme. For example, in the 

31 first pass read the control gate voltage would be set at about Ov 

32 to discriminate between the logical 1 state and the logical 

33 2/logical 3 states. In the second pass read the control/gate 

34 voltage is set to about 2v, to discriminate between the logical 
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1 3 state and the logical 1/ logical 2 states. By combining the 

2 information of this two pass read (e.g. according to Table 2) the 

3 original state of the 3 state cell is recovered. This biasing 

4 can be done independently of sense amp reference cell 

5 considerations allowing a single sense amp/reference cell circuit 

6 to detect the different states via a multi-pass read scheme. 
7 

8 TABLE 2 
9 

10 READ PASS 1 PASS 2 

11 gTATE [Ref. a ovl [Ref. = 2] 

12 1 Hi Hi 

13 .2 Lo Hi 

14 3 Lo Lo 
15 

16 The two options for erase operation/bias conditions shown in 

17 Table 1 stem from two different sets of considerations. The 

18 first option shown brings poly 2 control gate 108 to a large 

19 negative voltage, but allows poly 3 word line 109 to remain at a 

20 low voltage (e.g. Ov to 5v) . This is desirable since the word 

21 lines and their decoders are preferably high performance, and 

22 repeated many times with a tightly pitched requirement, making 

23 high voltage word line requirements more difficult and real 

24 estate consuming to implement. Poly 2 control or steering gate 

25 108 on the other hand could be common to a multiplicity of word 

26 lines (e.g. a sector consisting of 4 or more word lines), putting 

27 less demands on real estate and minimal impact to performance. 

28 Possible drawbacks of this approach are the process and device 

29 requirements to support both negative as well as positive 
3 0 polarity high voltage circuitry, and reduced steering 

31 effectiveness in that the channel cannot assist in steering by 

32 virtue of it being held at or near ground (i.e. can't go to large 

33 negative potential) . 
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1 Note that poly 2 is used only as a steering electrode during 

2 all three operations. Poly 3, which is the word line connection 

3 to the X~decoder, only sees OV to 5V (other than for erase option 

4 2), and its capacitance can be made relatively small. It is 

5 relatively easy to generate +5V and -17V on poly 2 since both 

6 writing and erasing are slow operations relative to reading and 

7 there is no DC current drain. The -17V does require high voltage 

8 PMOS in the erase decode, but the +5V on poly 3 aids in reducing 

9 the maximum negative voltage required on poly 2 during erase. 

10 The second option of using high word line voltage bias for 

11 erase eliminates both of the above potential drawbacks, but 

12 burdens the high performance, tightly pitched word line/driver 

13 with high voltage requirement. 

14 Figure Id is a plan view of one embodiment of an array 

15 consisting of a plurality of cells constructed as just described 

16 with respect to Figures la-lc, and using similar reference 

17 numerals. Also shown, are channel stop isolation regions 180. 

18 Figure le shows a block diagram of a memory array similar to 

19 that shown in the plan view of Figure Id which is organized by 
2 0 sectors, with appropriate control circuitry. Operation of one 

21 embodiment of such a memory array organized by sectors is shown 

22 in Figure If, where the abbreviations used have the following 

23 meanings: 

24 FLT « float 

25 V BE = bit line erase voltage 

26 Vvc = word line erase voltage 

27 DI = data in 

28 DIV ~ data in during verify operation 

29 V CEU = control gate erase voltage - unselected 

30 V CE = control gate erase voltage - selected 

31 S.A. - sense amplifier 

32 Vcm = control gate margin voltage (during verify operation) 
'33 V CP = control gate program voltage 

34 V CR = control gate read voltage 
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1 V CE = control gate erase voltage 
2 

3 As shown in Figures le and If, in this embodiment sectors are 

4 formed by a single column or a group of columns having their 

5 control gate connected in common. This allows a high speed shift 

6 register to be used in place of a row decoder in order to 

7 serially shift in a whole block of column data for the word 

8 lines, with the data for each word line being contained in the 

9 shift register on completion of its serial loading. The use of 

10 such a high speed shift register saves circuit area on an 

11 integrated circuit by serving both encoding and latching 

12 functions normally performed by a row decoder. Furthermore, 

13 speed is improved by including a parallel loaded buffer register 

14 which receives data in parallel from the high speed shift 

15 register and holds that data during the write operation. While 

16 the write operation takes place based upon the data stored in the 

17 buffer register, the high speed serial shift register receives 

18 the next block of data for subsequent transfer to the buffer 

19 register for the next write operation. In one embodiment of this 

20 invention, each sector has an associated latch for tagging that 

21 sector in preparation for an erase of a plurality of tagged 

22 sectors. 

23 In one embodiment of this invention, a sector is formed in a 

24 group of four cell columns, each column being 1024 bits tall with 

25 a common control gate and an associated sector latch. In this 

26 embodiment, verification of programming is performed in parallel 

27 on all to-be-programmed cells in a single column. Logical 0 

28 state cells have word lines at 0 volts while logical 1 state 

29 cells have word lines at a positive voltage, such as 5 volts. 

30 The control gate and drain voltages are reduced to- a verify level 

31 to allow for proper margin testing and the bit line current is 

32 monitored. If all of the to-be-programmed cells have been 

33 properly programmed, the bit line current will be 0 or 

34 substantially so. If not, it is known that one or more of the 
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1 to-be-programmed cells in the column have not been properly 

2 programmed, and another write operation is performed on the 

3 entire column, thereby assuring that any incompletely ones of the 

4 to-be-written cells are again written. An additional verify step 

5 is performed to verify that the column has been properly 

6 programmed. 

7 One embodiment of a process suitable for fabricating the 

8 structure having the cross-sectional view of Figure lb is now 

9 described. This embodiment can be implemented in a very small 

10 area with no need for an isoplanar oxide when utilizing a virtual 

11 ground, allowing an isolation implant to be placed in the 

12 remaining field which is not covered by diffusions or 

13 polycrystalline silicon and avoids susceptibility to substrate 

14 pitting associated with the SAMOS etch in the field isolation 

15 region not covered by poly 1. This is achieved, for example, 

16 with the following process sequence: 
17 

18 1. Form BN + bit lines in vertical strips. Grow approximately 

19 1500A oxide on top of BN + , and approximately 200-300A gate oxide. 
20 

21 

22 2. As shown in Figs. 7a and 7b, deposit poly 1 to a suitable 

23 conductance and etch in horizontal strips perpendicular to the BN + 

24 diffusion. Fill the spaces between adjacent strips of poly 1 

25 with deposited oxide, such as CVD followed by an etch back. This 

26 approach protects the field isolation regions, and if desired it 

27 can be preceded by a boron channel stop implant. 
28 

29 An alternative for steps 1 and 2 of the above process sequence is 

30 forming horizontal strips of isolation oxide first, and then 

31 depositing P a and etched back in RIE to fill and planar ize the 

32 horizontal grooves between adjacent strips of isolation oxide. 
33 
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1 3. Form thin dielectric 140 such as ONO of approximately 300-400 

2 A. covering poly 1 strips. 
3 

4 4. Deposit poly 2 and form a suitably thick dielectric over layer 

5 (e.g., approximately 2000-3000 A of CVD densified oxide). Etch 

6 this oxide and underlying poly 2 in long vertical strips parallel 

7 to bit line (BN + ) diffusions. 
8 

9 5. Form oxide spacers 62 along edges of poly 2 and use edge of 

10 these spacers to define the floating gate by etching off exposed 

11 poly 1 (i.e. poly 1 not covered by poly 2 or by spacer). 
12 

13 6. Form tunnel erase oxide in a conventional manner, as 

14 described in U.S. patent application serial number 323,779, filed 

15 March 15, 1989, over exposed edges of poly 1 as well as gate 

16 oxide over the channel of the select transistor (channel 106-1 in 

17 Figure lb) . 
18 

19 7. Deposit poly 3 or polysilicide, and form word lines in 

20 horizontal strips. 
21 

22 Another embodiment for achieving a virtual ground cell 

23 without the use of the buried diffusion formed early in the 

24 process is now described. In place of the BN+ of step 1, after 

25 step 6 a photoresist (PR) masked arsenic source/drain implant 

26 103a is used, self -aligned to one edge of poly 2 108 after poly 

27 1 107 stack formation but leaving an unimplanted region along the 

28 other edge to become the poly 3 controlled select transistor 

29 channel (see Figure 8). The isolation oxide thickness formed 

30 earlier between poly 1. strips is made sufficiently thick to 

31 withstand the self -aligned poly 2/1 stack etch without exposing 

32 the substrate to pitting, but thin enough such that following 

33 this stack etch it is readily removed to expose the substrate to 

34 the source drain implant. This offers the benefit of reduced 
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1 thermal drive of the arsenic junction laterally facilitating 

2 scaling. The remainder of the process steps of this embodiment 

3 follows the prior embodiment. 
4 

5 In summary, the novel cell of this invention offers the 

6 following benefits. 
7 

8 * Very low programming current, 

9 * Low programming drain voltage requirement/ eliminating 

10 the need for high voltage. 

11 * Immunity of Programmability to increased levels of 

12 erase. 

13 * Adjustability of memory state for optimum read of both 

14 program and erased states. 

15 * Improved margin by elimination of sensitivity to ± 10% 

16 Vcc variation on the steering element. 

17 * Potential for pure low voltage word line/decoder 

18 implementation . 

19 * Facilitates multi-state cell sensing. 

2 0 * Reduced susceptibility to source side hot-electron 

21 programming induced trapping by establishing a separate 

22 threshold control region at the drain. 
23 

24 A second array embodiment is similar to that of Figure Id but 

25 "uses the cell embodiment shown in Figure lb, to form a row 

26 oriented sector architecture, is shown in Figure lg. A sector 

27 consists of a group of rows, four in this example, which are 

28 erased together. Erase uses option 2 of Table 1, for this row 

29 oriented sector architecture, bringing all the poly 3 word lines 

30 of a sector to high voltage. The poly 2 steering gate is common 

31 to a group of N sectors where N can range from 1 to the full size 

32 of the memory array. Similarly the BN+ columns can alternatively 

33 continuously span the full length of the array or be broken down 

34 into a collection of shorter length, local columns. These 
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1 connect to a global (full array length) column through a select 

2 transistor driven by an additional level of decoding. The local 

3 columns can range from 1 to N sectors. The preferred embodiment 

4 is to have local columns span the same number of sectors as the 

5 poly 2 steering gate. A preferred number of sectors, N, spanned 

6 by local columns and poly 2 steering is around 8. This is 

7 because if N is much smaller than 8, the area overhead for local 

8 column section devices and poly 2 steering gate routing is high 

9 in relation to the area of arrayed cells, while if N is much 

10 larger than 8, the benefits of having localized columns and poly 

11 2 steering diminish. These benefits are: (1) reduced bit line 

12 capacitance improving read performance; (2) reduced repetitive 

13 exposure on unselected sectors to the raised voltage conditions 

14 on drains and steering electrodes when programming one sector 

15 within the N-sector group, and associated potential disturb 

16 phenomena; and (-3) increased confinement of array related 

17 failures thereby increasing the efficiency of replacing such 

18 failures. Read, program and unselected conditions are as 

19 described in Table 1, during read or program. The poly 3 word 

20 line in the selected row within the selected sector is turned on, 

21 5 volts for read and approximately 1 volt for programming. 

22 Concurrently, the drain to source bias conditions are applied to 

23 the columns, approximately 5 volts for program and approximately 

24 1.0-1.5 volts for read. In one embodiment, alternate bits in a 

25 selected row are programmed simultaneously, thereby permitting 

26 all bits in a selected row to be programmed utilizing two 

27 programming operations. In a similar manner, in this alternative 

28 embodiment, alternate bits in a selected row are read (or 

29 verified) simultaneously, thereby permitting all bits in a 

30 selected row to be read (or verified) utilizing two read (or 

31 verify) operations. After one row in the sector has finished 

32 reading or writing, the next row is selected, and so forth to the 
3 3 end of the sector. The resulting row oriented sector 
34 architecture and array operation is much more conventional than 
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1 the column oriented sector of the first embodiment, and 

2 consequently operates in a more traditional manner. Both 

3 embodiments share the intrinsic low power capability of this 

4 invention, but the row oriented sector embodiment requires, in 

5 addition, a full complement of data registers to support 

6 massively parallel write and verify features. 

7 Figure 2a shows an alternative array embodiment of this 

8 invention which does not utilize buried diffusion regions. Thus, 

9 source region 102 and drain region 103 are formed in a 

10 conventional manner and not buried by a thick dielectric layer as 

11 is the case in the embodiment of Figure lb. A plurality of 

12 memory cells are shown in Figure 2a along a cross section of a 

13 typical array structure, with elements of one such cell numbered 

14 using reference numerals corresponding to similar structure in 

15 Figure lb. Table 3 depicts an example of the operating 

16 conditions appropriate for the embodiment of Figure 2a. This a 

17 more traditional cell approach compared to the buried diffusion 

18 cell, with source/drain diffusions formed after all the 

19 polycrystalline silicon structures are formed. It requires one 

20 drain contact to metal bit line for every 2 cells, making it 

21 approximately 30% to 50% larger than the buried diffusion cell 

22 with similar layout rules. In all other respects, this 

23 alternative embodiment offers the same benefits as listed above 

24 for the buried diffusion embodiment of Figure lb. 

25 Figure 2b is a plan view of one embodiment of an array of 

26 memory cells constructed as described above with reference to 

27 Figure 2a. 

28 Figure 2c is an equivalent circuit diagram depicting the 

29 organization of such a memory array in sectors, with appropriate 

30 operating conditions and voltages shown. The preferred 

31 embodiment for a sector organized array uses two word lines which 

32 straddle a source line as part of a sector, along with their 

33 associated poly" 2 steering gates and source line. A full sector 

34 consists of some multiple of such pairing (e.g. 2 such pairs or 
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1 4 word lines, each word line containing 128 bytes and overhead 

2 cells, and straddling two source lines, constitute one sector). 

3 As shown in the embodiment of Figure 2c, the steering lines 

4 are connected together within a sector as are the source lines 

5 (i.e. a sector which consists of row lines grouped together 

6 respectively and driven by common drivers) . The embodiment 

7 described here confines the write operation to the sector being 

8 written to, while the bit line bias conditions (2.5v during read 

9 and approximately 5v possible during write) are non-disturbing to 

10 the cells because the bias is applied to the select transistor 

11 side of the cell and not to the floating gate side. In a two 

12 state cell, to write the cell to a logical one, the bit line is 

13 held at zero volts, causing the cell to program via source-side 

14 injection. Conversely, to inhibit writing, the bit line is held 

15 high (typically about 5 volts), thereby cutting off the channel, 

16 leaving the cell in the erased state. 

17 Sector erase takes place by tagging the selected sector and 

18 raising the associated row lines to a sufficiently high voltage 

19 to erase the floating gates to their required erased levels. 

20 Because of the low programming currents associated with 

21 source side injection (approximately 1-5 microamps/cell) , massive 

22 parallel programming is made practical, e.g. a full row line of 

23 approximately 1000 cells is programmed in a single operation with 

24 total current less than approximately l-5mA, thus providing more 

25 than 100 times more efficiency than prior art drain side 

26 programming arrays. 
27 
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1 TABLE 3 
2 

3 State Table & Operating Conditions (Fig. 2a) 
4 

5 Poly 3 Poly 2 

6 Node (Word (Steering 

7 operation line) Gate) praip Source 

8 

9 R R STANDBY Ov Ov Don't care Ov 

10 EE READ SELECTED 5v Ov 2 . 5v Ov 

11 A L READ UNS ELECTED 5v Ov Don't care Ov 

12 DA 

13 T 

14 E 

15 D 

16 

17 E R STANDBY Ov Ov Ov ! Ov 

18 RE 

19 A L 

20 S A 

21 E T ERASE Option 1 12v-22v Ov Ov Ov 

22 E 

23 D Option 2 5v -10v to -12v Ov Ov .., 

24 ; ' 

25 PR PROGRAM Hl.Ov 14-20 Ov 5v-8v 

26 RE SELECTED 

27 O L 

28 G A 

29 R T PROGRAM OV 14-2 0 Ov 5v-8v 

30 A E DESELECTED Hl.Ov 14-2 0 5v 5V-8V 

31 M D Ov 14-20 5v 5v-8v 
32 

33 

34 Figure 3 is a graph depicting the gate current into poly l 

35 gate 107 of Fig. lb (which is not floating in the Figure 3 test 

36 device to allow this measurement to be made) as a function of 

37 poly 1 gate voltage (V poly x ) while keeping the select transistor 

38 110 V _ at just above its threshold. In this way most of the 

p2 

39 potential drop in channel 106 of Figure 1 occurs in channel 

40 portion 106-1 underneath gate 109 of select transistor 110, and 

41 electrons accelerated in this channel are then injected onto 

42 floating gate 107. From Fig. 3 it is -seen the hot electron 

43 programming injection efficiency of this device is phenomenally 

44 high. 
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1 Various embodiments of a process suitable for fabricating a 

2 structure in accordance with the embodiment of Figures la-Id are 

3 now described. Reference can also be made to copending U.S. 

4 Application Serial No. 323,779 filed March 15, 1989 (now U.S. 

5 Patent 5,070,032), and assigned to SunDisk, the assignee of this 

6 invention. Reference may also be made to fabrication process 

7 steps described earlier in this application. A starting 

8 substrate is used, for example a P type substrate (or a P type 

9 well region within an N type substrate) . A layer of oxide is 

10 formed, followed by a layer of silicon nitride. The layer of 

11 silicon nitride is then patterned in order to expose those areas 

12 in which N+ source and drain regions are to be formed. The N+ 

13 source and drain regions are then formed, for example, by ion 

14 implantation of arsenic to a concentration of approximately 

15 lxl0 20 cm" 3 . The wafer is then oxidized in order to form oxide 

16 layers 104 and 105 in order to cause source and drain regions 102 

17 and 103 to become "buried". Note that for the embodiment of 

18 Figure 2a, this oxidation step is not utilized, as the source and 

19 drain regions are not "buried". Rather, the source and drain 
2 0 regions are formed after all polycrystalline silicon layers are 

21 formed, in a conventional manner. The remaining portion of the 

22 nitride mask is then removed, and the oxide overlying channel 
2 3 regions 106-1 and 106-2 is removed. A new layer of gate oxide 

24 overlying channel regions 106-1 and 106-2 is formed, for example 

25 to a thickness within the range of 150A to 300A and implanted to 

26 the desired threshold (e.g. approximately -lv to +lv) . 

27 Polycrystalline silicon is then formed on the wafer and patterned 

28 in order to form floating gate regions 107. If desired, the 

29 polycrystalline silicon layer is patterned in horizontal strips 

30 (per the orientation of Figure la), with its horizontal extent 

31 patterned at the same time as the patterning of the second layer 

32 of polycrystalline silicon, as will be now described. Following 

33 the formation polycrystalline silicon layer 107 at this time, a 

34 layer of oxide or oxide/nitride dielectric is formed over the 
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remaining portions of polycrystalline silicon layer 107. A 
second layer of polycrystalline silicon 108 is then formed and 
doped to a desired conductivity, for example 30 ohms/square. The 
second layer of polycrystalline silicon is then patterned into 
vertical strips (again, per the orientation of Figure la) . If 
the hori 2 ontal extent of polycrystalline silicon layer 107 was 
not earlier defined, this pattern step is also used to remove the 
layer of dielectric between the first and second layers of 
polycrystalline silicon in those areas where the first layer of 
polycrystalline silicon is to be patterned simultaneously with 
the patterning of the second layer of polycrystalline silicon. 
Following the first layer patterning, an additional layer of 
dielectric is formed on the wafer to form the gate dielectric 
above channel region 106-1, and above any other areas in the 
silicon substrate to which the third layer of polycrystalline 
silicon is to make a gate. These regions can then be implanted 
to the desired threshold voltage (e.g. approximately 0.5v to 
1.5v). The third layer of polycrystalline silicon is for a 
transistor (ranging from 200A to 500A in thickness) then formed 

20 and doped to appropriate conductivity, for example 20 

21 ohms/square. Polycrystalline silicon layer 109 is then patterned 

22 in order to form word line 109. 

23 in one embodiment of this invention, polycrystalline silicon 

24 layer 107 is patterned to form horizontal stripes and channel 

25 stop dopants (e.g. boron) are implanted into the exposed areas 

2 6 therebetween in order to form high threshold channel stop regions 

27 between adjacent rows of a memory array. The thickness of the 

28 gate dielectric between channel 106-2 and polycrystalline silicon 

29 floating gate 107 can range from approximately 150 angstroms or 

30 less to approximately 300 angstroms or more, depending on 

31 performance tradeoffs. For increased drive for reading, a 

32 thinner gate dielectric is desired while for increased coupling 

3 3 between polycrystalline and silicon control gate 108 and floating 
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1 gate 107 (helpful during programming) a thicker gate dielectric 

2 is desired. 
3 

4 ggr.pnd Embodiment 

5 Figure 5 is a two-poly embodiment in which programming occurs 

6 by taking drain 303 high, for example about 10V while raising 

7 control gate 308 just sufficiently so as to turn on select 

8 transistor 310. Since this V CG voltage can vary from one device 

9 to another it is possible to achieve the optimum injection 
conditions by keeping V QG at about 3V while raising source 
(virtual ground) 302 in a sawtooth fashion from , about 0 to 3 

~12 volts and back to 0 again, with a period on the order 

13 approximately 1 microsecond. 

14 This ensures that at some point along the sawtooth the 

15 optimum injection conditions are met. Reference can also be 

16 made to European Patent Application Serial No. 89312799.3 filed 

17 August 12, 1989. To further enhance programming efficiency, in 

18 one embodiment a programming efficiency implant 330 (shown in 

19 dotted line) is introduced at the source side. To read the 

20 device, its source is 0V, drain is approximately l.Ov and V CG 

21 approximately 4.5-5v. To erase we employ poly 1-poly 2 

22 tunneling between floating gate 307 in word line 308 at the 

23 tunneling zone, consisting of one or more of the floating gate 

24 edges, sidewall, corners of the top edge, portions of the top 

25 and portions of the bottom, of floating gate 307, associated 

26 with a tunnel oxide (400A-700A) . Erase takes place with V CG 
approximately 12-22V, V p = 0V, V g = 0V. A capacitive 
decoupling dielectric (approximately 1500 to 2000A thick) 340 
is formed on top of poly 1 to reduce the capacitance between 

30 poly 1 and poly 2. 

31 In one embodiment of this invention, a high electrical 

32 field region is created in the channel far away from the 

33 reverse field region located in conventional devices near the 

34 drain. This is achieved, for example, by utilizing region 330 
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of increased doping concentration at the boundary between 
channels 306-1 and 306-2 under floating gate 307. In one 
embodiment, the width of region 330 is on the order of 0.1 
microns. A larger dimension for region 330 can be 
counterproductive, reducing the select transistor drive with no 

6 gain in efficiency. 

7 Figure 4 depicts the electrical field distribution along 

8 channels 306-1 and 306-2 in structures with and without P + 

9 doped region 330. In a structure without region 330 and 

10 improperly biased select transistor the electron injection can 

11 take place in the high field region near drain 303. Because of 

12 the vertical field reversal region near drain 303, the 

13 resultant injection efficiency is reduced. In a structure with 

14 region 330 the injection takes place in the high field region 

15 located at region 330, far away from the field reversal region. 

16 Because of this, increased injection efficiency is achieved. 

17 From the processing side there are three problems which 

18 must be addressed properly: 

_ The formation of sufficiently thin/high quality gate 

21 dielectric over BN+, which tends to oxidize more quickly than 

22 undoped silicon. 

23 2. The misalignment between poly 1 and the buried N+ drain 

24 diffusion strongly affects the coupling ratios for programming 

25 and erase. This can be overcome at the expense of an increase 

26 in cell area by not using a virtual ground array, but instead 

27 a shared source array. 

28 3. This array permits floating gate 307 to completely overlap 

29 the buried N + diffusion in a dedicated source arrangement, 

30 eliminating this alignment sensitivity. Unfortunately, this 

31 array requires an extra isolation spacing adjacent to the BN+ 

32 to prevent the poly 1 extension beyond BN+ in the direction 

33 away from channel 306-2 to form a transistor in the neighboring 

34 cell. 



19 
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1 To achieve small cell size in the buried diffusion 

2 direction a channel stop isolation is used between adjacent 

3 cells, plus a self -aligned stacked etch to simultaneously 

4 delineate poly 2 and poly 1- This is difficult to do without 

5 pitting the substrate as well as the exposed BN+ when etching 

6 the exposed poly 1 between adjacent cells. This is especially 

7 difficult to avoid when etching the decoupling oxide (1500 - 

8 2000A thick on top of poly 1 in order to expose poly 1, since 

9 the substrate unprotected by poly 1 also becomes exposed, so 

10 that when poly 1 is etched, the substrate in those regions 

11 becomes pitted. 

12 This will therefore require formation of a thick dielectric 

13 region as part of the field isolation process protecting the 

14 substrate in the space between the poly 2 word lines. This can 

15 be accomplished by using a process as described in U.S. Patent 

16 Application : Serial No. 323,779, filed March 15, 1989, and 

17 assigned to SunDisk, the assignee of this application. This is 

18 actually forming trench isolation, but with BN+ abutting this 

19 trench, we may experience severe junction leakage as well as 

20 loss of a portion of the BN+ conductor. This cell of this 

21 second embodiment is attractive because it is double poly, low 

22 programming current, very fast programming, programming away 

23 from drain junction, small and scalable cell. Cell size is 

24 quite attractive as indicated below for three representative 

25 geometries: 

26 1.0m geometries: cell =4. 0x2. 0=8. 0m 2 

27 0.8m geometries: cell = 3.2 x 1.6 = 5.2m 2 

28 0.6m geometries: cell = 2.3 x 1.2 = 2.8m 2 
29 

3 0 Third Embodiment 

31 Figure 6 is a cross-sectional view of alternative 

3 2 embodiment of a two poly cell, using source side injection for 

33 programming, aided by strong coupling to buried N+ drain 403, 

3 4 which acts also as a second control gate. Erase is by Fowler- 
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1 Nordheim tunneling to channel 406 through a small thinned oxide 

2 region, formed for example to a thickness of about 100A, by 

3 utilizing a thin polyspacer. These process steps would be as 

4 follows: Once the drain oxide is formed (i.e. the oxide above 

5 drain 403), a first layer of poly, (approximately 2000A to 

6 4000A thick) is deposited and a thin nitride dielectric is 

7 deposited on top. These layers are then etched using a poly 1 
mask to delineate the lateral extent (as shown in Figure 6) of 

9 the poly 1. A second layer of nitride is then deposited and 

10 anistropically etched back to underlying oxide, leaving the 

11 initial nitride layer on top of poly 1 plus nitride spacers 
along the poly 1 sidewalls. This protects the poly 1 sidewall 
from subseguent oxidation, allowing electrical contact to be 
made as later described. The exposed oxide layer over the 

15 channel portion of the substrate is then stripped and regrown 
to the 100A thickness reguired for tunneling, while a 



8 



12 
13 
14 



16 



BN+ 



17 photoresist masked pattern protects oxide over the exposed, 

18 side of the poly 1 from being stripped. The nitride layers 

19 surrounding poly 1 prevent oxide from forming on that poly. 

20 The thin nitride is then etched off using a highly selective 

21 etch which does not attack or degrade the 100A tunnel oxide 

22 (e.g. hot phosphoric or plasma etch). This is followed by a 

23 second poly deposition which electrically contacts the first 

24 poly on its top surface and its sidewalls. This structure is 

25 then etched using an anisotropic poly-silicon etch, with etch 

26 being terminated with the re-exposure of the oxide layers over 

27 substrate beneath the second deposited poly layer. This 

28 completes the formation of the poly 1 floating gate stripe 

29 shown in Figure 6. The remaining process is similar to that of 

30 the second embodiment. 

31 m this embodiment, programming is from hot channel 

32 electrons injected from grounded source diffusion 402 with 

33 drain 403 held at about +8v and fixed control gate of around 

34 1.5v. Alternatively, programming is performed by hot channel 
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1 electrons from source diffusion 402 utilizing a sawtooth 

2 control gate voltage ranging from 0 volts to a peak voltage 

3 approximately 3 volts, as described previously for the second 

4 embodiment. Read is achieved with V Dg » 1.5V, Vs = 0, V C G = 

5 +5V. Erase is achieved with V CG = -22V, Vs=Vd=OV. In this 

6 embodiment, the poly 2 word line 408 will carry the +5 volts 

7 during read and the -22 volts during erase, thereby requiring 

8 an X-decoder capable of serving this purpose. Coupling 

9 considerations require that °P2P1 > CP1D, which is unfavorable 

10 for programming. Therefore the cell must be optimized for 

11 balancing erase against programming by adjusting oxide 

12 thicknesses and floating gate threshold to the optimum print. 

13 There is less of a problem with pitting the field regions 

14 between cells in the poly 1 direction (because poly 1— poly 2 

15 oxide or ONO is thin) . This may obviate the need for the 

16 additional thick oxide field region described for the second 

17 embodiment. However, there is the additional process 

18 complexity of forming the thin oxide region and extra space 

19 needed to place this thin oxide region sufficiently far from 

20 the source diffusion. 
21 

22 Alternative Operating Methods 

23 A number of alternative methods are possible to program the 

24 source side injection cells described in the previous 

25 embodiments. Strong capacitive coupling (for example, using 

26 thin ONO) is required in the second and third embodiments 

27 between poly 2 and drain, and between poly 2 and poly 1, 

28 respectively, for programming. During operation, one embodiment 

29 applies V D at 5 to 7v, V s = O, the control gate voltage V CQ is 

30 raised to just turn on the control gate channel, and V p2 is on 

31 the order of about 12 volts or more. Alternatively, the source 

32 body effect is used to advantage- In this alternative 

33 embodiment, rather than bringing control gate to a specified 

34 value to just turn on the channel, the control gate is brought 
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1 to a value greater than the voltage required to just turn on 

2 the channel (e.g., approximately one volt above) and a pull- 

3 down circuit is used (e.g., a high impedance resistor or a 

4 current sink) for providing approximately 1 MA current flow via 

5 source debiasing. Alternatively, the control gate voltage V CG 

6 can be operated in a sawtooth fashion from between 0 volts to 

7 about +3 volts, as mentioned previously with respect . to 

8 European patent application serial number 89312799.3. 
9 

10 MnH-i-hit Cells 

H In an alternative embodiment of this invention, such as is 

12 shown in Figures 9a and 9b, a novel structure is taught 

13 including a multi-bit split .gate cell, using source side 

14 injection programming and using poly-to-poly tunneling for 

15 erase. The following describes, in more detail, the operation 

16 of one embodiment of such a structure of this invention. 

17 Basic read operation for such a cell consists of applying 

18 appropriate control gate bias (e.g. 8v - see TABLE 4) to the 

19 unread portion (henceforth for convenience to be termed the 

20 transfer portion) ,. while applying the required read control 

21 gate -bias to the portion being sensed (in multi-state this 

22 would be a bias level appropriate to the state being sensed 

23 for). In one embodiment, the select gate bias is held at 

24 approximately 1.5 volts to keep total cell current limited 

25 (e.g. to about 1 microamp) , independent of the floating gate 

26 conduction level. Alternatively, the select gate bias is 

27 maintained at any desired level, e.g. about 5 volts, depending 

28 on the current sensing requirements. Similarly, to program a 

29 bypass applied on the transfer portion (about 12v) and a 

30 writing potential on the control gate portion (again in multi- 

31 state this would be a bias level appropriate to the state being 

32 written) , with the select gate bias throttled for source side 

33 emission (about 1.5v), and the drain bit line (the bit line 

34 adjacent the to-be-programmed floating gate) raised to about 5v 
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1 for programming, with the source bit line (adjacent to transfer 

2 portion) grounded. 

3 

4 TABLE 4 

5 OPERATING MODES /CONDITIONS 



6 



V? 

7 


CONPITION 




CGL2 


SGI 


CGR2 




8 
9 




• 










10 














11 


R STANDBY 


X 


X 


Ov 


X 


X 


12 


E READ UNSELECTED 


FLOAT 


X 


1.5v 


X 


FLOAT 


13 


A READ FGL12 


Ov 


READ VREF 


1.5v 


8v 


1.5v 


14 


D READ FGR12 


1.5V 


8v 


1.5v 


READ VREF 


Ov 


15 














16 


ERASE 


Ov 


Ov 


VE 


Ov 


Ov 


17 














18 














19 


P 












20 


R STANDBY 


X 


X 


OV 


X 


X 


21 


0 PROG UNSELECTED 


FLOAT 


X 


1.5v 


X 


FLOAT 


22 


G PROG FGL12 


5v 


PROG VREF 


1.5v 


12v 


Ov 


23 


R PROG FGR12 


OV 


12V 


1.5v 


PROG VREF 


5V 



24 A 

25 M 
26 

27 NOTES: X - DON'T CARE; VE - OPTIMUM ERASE VOLTAGE (~<20v) 

28 

29 



30 Following are some key advantages of the multi-bit cell of 

31 this embodiment of this invention: 
32 

33 (1) Approaches (2*lambda) 2 cell size 

34 (2) Highly self-aligned 

3 5 (3) High efficiency source side programming, resulting in 

3 6 lower power and lower voltage requirements, allowing 

37 greater parallelism during write 

38 (4) Attractive for scalability 

3 9 (5) Totally immune to overerase 

41 This cell can achieve (2*lambda) 2 cell size, where lambda 

4 2 is the minimum lithographic feature, because (1) each of its 
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1 lateral component parts, in both its word line and bit line 

2 directions, can be formed using this minimum lambda feature, 

3 and (2) the various critical components are self -aligned to one 

4 another, obviating the need to increase cell size to 

5 accommodate lithographic overlay registration requirements. 

6 ' For example, viewing along the row or word line direction, the 

7 floating gate poly2/l self-aligned stacks and their underlying 

8 channels can be formed using the minimum feature lithographic 

9 width (lambda), while the transfer channels and bit line 

10 diffusions can be simultaneously delineated using the minimum 

11 lithographic space between features (also lambda) , giving it a 

12 (2*lambda) minimum pitch capability along this direction. 

13 Similarly, looking along the poly2 steering gate in the bit 

14 line direction, the channel regions underlying polyl floating 

15 gate and poly3 word line can be formed using the minimum 

16 lithographic feature (lambda), while the isolation region 

17 between word line channels can be formed by the minimum 

18 lithographic space (also lambda), again achieving the minimum 

19 pitch of (2*lambda). In this way, the cell achieves the 

20 (2*lambda) 2 minimum layout area. It is in fact a self -aligned 

21 cross-point cell, the poly2/l stack and corresponding channel 

22 being fully self aligned to the transfer channel and bit line 

23 diffusions, and in the orthogonal direction the isolation being 

24 self-aligned to the channel areas. When combining this with 

25 the low voltage requirement made possible by the source-side 

26 injection programming mechanism, this makes it an ideal element 

27 for still further scaling (i.e. smaller lambda). Finally, its 

28 immunity to overerase comes from the following two factors: 

29 (1) the presence of the series transistor channel select 

30 region, which fully cuts off cell conduction when deselected, 

31 independent of degree of erasure, and (2) the source-side 

32 injection mechanism itself, which is enhanced with strong 

33 overerase, in contrast to the more conventional drain-side 



19619 
062298 



33 



6 o 

1 programming, which becomes retarded by strong levels of 

2 erasure. 

3 in one embodiment, rather than the use of 100A tunneling 

4 oxide for the erase operation as in the prior art Ma approach, 

5 a thick oxide, geometrically enhanced, poly-to-poly tunneling 

6 approach is used, as shown for example in Figures 9a and 9b, 

7 where the poly3 word line serves the dual function of cell 

8 selection and erase anode (one of the architecture/operational 

9 approaches taught in the above-mentioned SunDisk Patent No. 

10 5,313,421). Figures 10a and 10b shows the eguivalent circuit 

11 of this cell/array and TABLE 4 summarizes its operation. 

12 The advantages of this embodiment include: 

13 * Erase unit to follow row line(s), resulting in row oriented 

14 sectoring; 

15 * Avoids need to use negative voltages, erase being 

16 accomplished by holding all electrodes at ground, except for 

17 the selected sector (s) poly3 word lines, which are raised to 

18 erase potential (about or less than 20v) ; 

19 * High reliability inherent to thick oxide tunneling 
20 - implementation; and 

21 * Improved scalability inherent to the use of the thick 

22 interpoly oxide (and consequent reduced parasitic capacitance, 

23 both because of the greater thickness and because of the small 

24 sidewall vicinity limited tunneling area) , combined with the 

25 high degree of vertical integration (vertically stacked poly3 

26 word line serving the dual role of select gate and erase 

27 electrode) . 

28 Such a cell approach offers the potential for a physically 

29 minimal (4*lambda 2 ), highly self aligned, crosspoint cell, 

30 which is both very reliable (use of thick oxides and no high 

31 voltage junction requirements within memory array) , and readily 

32 scalable (via the source side injection element and its reduced 

33 voltage "and more relaxed process control requirements, combined 

34 with the inherent salability of the vertically integrated, 
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thick oxide interpoly erase element). From a physical point of 
view therefore, a Gigabit (or greater) density level embodiment 
based on a 0.25 M technology, has a per bit area of 

4 approximately 0.25ju . 

5 Despite the series nature of the dual gate cell, a four 
level multi-state (two logical bits per floating gate, or four 
logical bits per dual gate cell) can be implemented. The key 
requirement is that the most heavily programmed state plus bias 
level of the transfer floating gate's control gate be optimally 
selected to expose the full multi-state conduction range of the 
memory floating portion, without introducing read disturb. 
Based on the above example, a four-level multi-state 
implementation would give a per bit area approaching 0.1 M 2 

14 (approximately 0.125jt). 

15 in summary, the above described dual-gate cell based on the 
thick oxide, row oriented erase approach offers a novel, non- 
obvious implementation, one that offers significant 
improvements over the prior art in scalability, reliability and 

19 performance. 

21 Alt.rnatrr ^^"^ ffl-nisina Neg ativP Storing Cell 

22 O peration 
23 

24 The control gate (or steering) bias voltage level or range 

25 of levels for reading constitute a powerful parameter in 
setting the memory window voltage position and corresponding 
ranges for the steering element during programming operations 
and the poly3 control/erase element during erase. By allowing 
this level or range of levels to go below Ov, this allows 
shifting up of the floating gate voltage memory window (due to 
its associated charge) by a proportional amount, governed by 
the steering gate coupling ratio. The net result is the 
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33 maximum steering gate voltage level, for both sensing and 

34 programming, is reduced by that negatively shifted amount. 
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Similarly, with the steering gate taken below Ov during erase, 
the maximum erase voltage is also lowered, the amount of which 
is proportional to the steering gate coupling ratio. 

An important parameter in determining steering voltage 
magnitudes is the steering gate coupling ratio, RCG (or R21) - 
C21/CTOT, where C21 is the capacitance between the polyl 
floating gate and the polyZ steering gate, and CTOT is the 
total floating gate capacitance. For example, if the net 
requirement for read plus programming is to capacitively shift 
the floating gate potential by lOv, then given an RCG. of 50%, 
the steering voltage swing must be scaled up by 1/RCG, giving 
a 20v swing. If, on the other hand, RCG is increased to 66.7%, 
the steering voltage swing drops to 15v, a savings of 5v. 
Using this 66.7% value, if the read steering bias voltage level 
(or range) is lowered by 7.5v, the poly3 erase voltage is 
lowered by RCG*7.5v, a savings of 5v over the non-lowered bias 

17 situation. 

18 in order to implement negative steering into an N channel 
based, grounded substrate memory array, one embodiment utilizes 

, P channel circuitry, capable of going negative of ground, to 
21 generate and distribute this bias. In order to support the 
full steering voltage dynamic range, the N well for such 
P channel circuitry is biased to the maximum required positive 
„ voltage, and the P channel circuitry can thus feed any 

25 potential from that value on down to the most negative required 

26 (independent of memory array ground). The positive and 

27 negative voltage limits are provided from either external 

28 supplies or readily generated on chip (for example by N channel 

29 based charge pumps for positive bias and P channel for negative 
bias), since no DC current is required for steering (only 

31 capacitive load charging) . 

32 * m one embodiment, a full column oriented array 

33 segmentation is implemented to form one sector or a group of 

34 row oriented sectors, wherein one sector is read or programmed 
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1 at any given time. All cells in one sector are erased 

2 simultaneously, and one or more sectors can be selected for 

3 simultaneous erasure. Column based segmentation breaks a full 

4 array into a multiplicity of segmented sub-arrays, thereby 

5 eliminating large and/or cumulative parasitics such as 

6 capacitance and leakage. Each sub-array has its own set of 

7 local bit line diffusions and poly2 steering lines, which are 

8 selectively connected by segment select transistor matrixes to 

9 corresponding global bit lines and steering lines. 

10 Figure 10c exemplifies such a segmentation embodiment, 

11 depicting one segment, denoted as SEGMENT I, consisting of N 

12 rows of cells (e.g. N egualling 32). For example, each row 

13 forms one sector consisting of 2048 dual gate cells or 

14 equivilantly 4096 floating gate storage elements. 

15 Alternatively, a sector can be formed by a group of two or more 

16 rows. The long, continuous, global bit lines (typically run in 

17 metal) BLk are selectively connected to the local segment 

18 subcolumns through the Segment Bit Line Transfer Select 

19 transistors 1001, 1002, driven by the SEGi lines. Similarly, 

20 the long, continuous global steering lines (typically run in 

21 metal) Sk are selectively connected to the local segment 

22 steering gates through the Steering Drive Transfer Select 

23 transistors 2001, 2002, driven by the STD_ODDi and STD_EVENi 

24 lines. In this way array segments are isolated from one 

25 another, eliminating the large cumulative parasitics of leakage 

2 6 and capacitance, and providing column associated defect and 

27 repetitive disturb confinement. 

28 Performance can be increased by simultaneously operating on 

29 as many cells in one row as possible (where a row may have 

30 anywhere from IK to 4K floating gate memory transistors) , 

31 thereby maximizing parallelism. Peak power is not a limitation 

32 in such implementation, because of the low cell operating 

3 3 currents inherent to this cell approach both during read and 
34 programming operations. Consequently, the number of floating 
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gate transistors per row which can be simultaneously operated 
on is limited only by addressing constraints and segment decode 
restrictions. For the embodiment shown in Figure 10c, this 
allows every 4th floating gate to be addressed and operated on, 
simultaneously, as outlined in TABLE 5, allowing the full row 
to be addressed and operated on in four passes as follows. 

During each pass, two adjacent diffusions are driven to 
drain potential followed by two adjacent diffusions driven to 
ground, with that bias pattern repeated across the entire row 
of cells, in this way global drain/source bias is applied in 
mirrored fashion to every other of the selected cells, 
Z resulting in floating gate bias conditions of odd selected 
13 cells being reversely applied to those of the even selected 
cells. Appropriate biases are placed on the global steering 
lines, as exemplified in TABLE 5, to satisfy the operation of 
the targeted floating gates as given in TABLE 4, while the 
local steering lines of the unselected cells are discharged and 
18 left isolated from the global steering lines. Once done, the 
bias conditions for both global bit/ground lines and 
targeted/untargeted floating gate steering lines are 
correspondingly interchanged to operate on the other of the 
floating gate pair within the selected cells. Once this is 
completed, similar operation is repeated to the alternate set 
(i.e. previously unselected set) of cells, thereby completing 
25 full row programming in four passes. 
26 
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1 

2 To give an idea of the high speed of this approach with 

3 respect to programming, assuming a physical row of 4096 floating 

4 gate elements, and lOjusec per pass for cell programming, this 

5 gives an effective programming time of ~10nsec/bit or a raw 

6 programming rate of 4096bits per 40/xsec (i.e. per 4*10Msec) or 

7 ~12.5MBytes/sec. 

8 In order to accommodate the negatively shifted steering in 

9 this embodiment, the steering segmentation transistor matrix is 

10 implemented in positively biased N well, P channel based 

11 circuitry. 

12 As indicated above, in order to reduce maximum voltage levels 

13 required, it is desirable to keep the steering gate coupling 

14 ratio relatively high, for example, greater than approximately 

15 60%, (see Figure 10a for one embodiment of a cell equivalent 
circuit). In one embodiment, ONO interpoly2/l dielectric (with, 
for example, an effective tox of 200A) is used, combined with a 
cell structure and process approach (described below), which 



16 
17 



18 



19 reduces the parasitic substrate and interpoly3/l capacitances. 

20 Parasitic capacitances to substrate and drain are, in one 

21 embodiment, kept small by using a narrow channel structure, 

22 bounded by much thicker field oxide regions (such isolation 

23 structure is described in U.S. patent 5,343,063). By way of 

24 example, a cell with a narrow (for example, about 0.1m wide), 

25 approximately 300A thick gate oxide channel region bounded by 

26 about 1500A thick field regions, whose floating gates are laid 

27 out so as to substantially overlap those thick field regions (for 

28 example with a total overlap of, about 0.3*0, would, in 

29 combination with the scaled ONO interpoly2/l, provide steering 

30 capacitance magnitudes of around five times larger than those of 

31 the floating gate to substrate/drain. 

32 In order to reduce the interpoly3/l capacitance, it must 

33 first be noted that in this dual floating gate Flash cell, poly3 

34 crosses two edges of the polyl floating gate, resulting in 
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1 approximately double the interpoly3/l capacitance of cells in 

2 which poly 3 crosses only a single polyl edge (for which parasitic 

3 coupling ratios are typically around 15%) . Although the double 

4 edge structure may offer benefits to the erase tunneling element 

5 (e.g. voltage levels and distributions), its benefit is 

6 outweighed by the higher erasing and programming gate voltages 

7 needed to offset the associated poorer coupling efficiencies. 

. 8 Therefore, it is desirable to eliminate the capacitive impact of 

9 one of these two edges, even if in doing so its erase tunneling 

10 contribution is also eliminated. The following discussion 

11 describes one embodiment of a process to accomplish this, 

12 integrated into the self-aligned diffusion (BN+) formation 

13 process. 

14 To realize a self-aligned BN+ cell, the BN+ sources/drains 

15 must be formed after the poly2/l stack etch (i.e. self -aligned to 

16 poly2) thereby realizing the physically smallest cell. The 

17 challenge here is to remove the field oxide locally over the S/D 

18 region to allow BN+ As implant, while at the same time preserving 

19 sufficiently thick dielectrics surrounding the poly2 steering 

20 line for poly3 to poly2 high voltage isolation. The following 

21 section details the above mentioned exemplary process. 

22 In looking at the twin cell in cross-section (see Figures 

23 lla-lle for top view and various cross-sections, and in 

24 particular Figure lie), the process strategy to achieve both 

25 self-aligned BN+ formation and poly3/l coupling reduction lies in 

26 the ability to separately process the two distinct regions, 

27 namely (1) the vertical strip regions associated with the BN+ and 

28 (2) the vertical strip containing the select channel portions. 

29 In so doing, the poly3/l tunneling edge can be restricted to only 

30 form adjacent to the select strip, while completely eliminating 

31 its formation along the polyl edge bordering the BN+ strip. 

32 This is accomplished in the following manner (refer to Figure 

33 12a for cross-sections in row line direction, following some of 

34 the key process steps. NOTE: the poly3 row lines are defined 



19619 



43 



1 here to run horizontally, and the BN+ columns to run vertically) . 

2 By way of example, the following discussion includes 

3 representative numbers for dimensions and thicknesses, assuming 
4. a 0.25/i technology (printing minimum lithographic feature size, 

5 both width and space, to achieve minimum pitch). 

6 Form field oxide 1100 to a thickness of about 1500A, and etch 

7 it into horizontal strips, adding appropriate channel/ field 

8 implants prior to or at this step. Use an oxide spacer approach 

9 to reduce channel width (for example, reduce from about 0.25m as 

10 etched to about 0.1M post spacer formation, thereby improving 

11 control gate coupling) . Grow floating gate oxide 1101, 

12 (approximately 300A gate oxide). As shown in Sundisk U.S. patent 

13 5,343,063, the fabrication steps up through the forming of polyl 

14 1102 to a thickness of about 1500A are performed. Polyl is then 

15 etched into horizontal strips overlying the channel regions plus 

16 generous overlap on the field region to either side of the 

17 channel. As with channel width, a spacer approach can be used to 

18 decrease the etched polyl spacing, thereby increasing net polyl 

19 overlap of field oxide, or ''wings''. For example, after the 

20 spacer step, polyl spacing is reduced to about 0.1/i, giving polyl 

21 wings of about 0.15 M per side - refer to Figure lib showing a 

22 cross-section through the channel along the column direction for 

23 an example of polyl wings over field oxide. Note that because of 

24 the narrow channel widths vis a vis the polyl thickness, polyl 

25 1102 will completely fill the trench, resulting in a 

26 substantially planar surface. Next form thin 0N0 1103 (for 

27 example, having about 200A tox effective) on top of and along 
edges of the polyl strips. In an alternative embodiment, a 
portion of the top film is formed as part of an initially 

30 deposited polyl layer stack. 

31 Referring to Figures 12a(i) and 12a(ii) , deposit a sandwich 

32 layer of poly2 1104 (about 150oA) , thick poly3/2 isolation oxide 

33 1105 (approximately 2000A) , plus a sufficiently thick" etch 

34 stopping layer 1106 (to block underlying oxide removal when 
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1 exposed to oxide type etch), and top oxide layer 1107. Using a 

2 patterned photoresist masking layer 1108, these are then etched 

3 in strips along the column direction, down to the polyl layer, to 

4 form poly2 the steering gate lines. These exposed polyl regions 

5 are overlying the areas to become select channel and BN+. 

6 Referring to Figures 12b(i) and 12b(ii) , strip previous 

7 photoresist and pattern new photoresist layer 1109 to cover and 

8 protect exposed polyl over the select channel regions. Etch 

9 exposed polyl 1102 and all its underlying oxide 1101 which cover 
the to-be-formed BN+ regions. Oxide layer 1107 over etch 

11 stopping layer 1106 is used to protect etch stopping layer 1106 

12 from being etched by the poly etch as polyl 1102 is being 

13 removed. Etch stopping layer 1106 (e.g. thin undoped polysilicon 

14 or possibly nitride - must have low etch rate compared to oxide 

15 etch rates) is used to prevent that portion of thick poly3/2 

16 isolation oxide 1105 not covered by photoresist 1109 from being 

17 etched down as oxide 1101 beneath polyl 1102 is etched away. The 

18 oxide etch system used is both highly anisotropic (e.g. RIE) and 

19 selective vis a vis the underlying silicon substrate, resulting 

20 in negligible etching of that substrate, accommodating the large 

21 differences in oxide thicknesses being removed between field 

22 oxide (approximately 1500A) and gate oxide regions (approximately 

23 300A) . Following completion of all etching, photoresist 1109 is 
2 4 removed . 

Referring to Figures 12c(i) and I2c(ii), at this point, an 
option is to implant and drive a sufficient Boron dose to form a 

27 p+ DMOS type doping profile adjacent to the BN+ junction 

28 (alternatively, this is the point at which the arsenic BN+ is 

29 implanted, but the resulting lateral diffusion makes the floating 

30 gate channel unnecessarily short). As shown in Figs. 12c(i) and 

31 12c(ii), oxide is formed and reactive ion etched back down to 

32 silicon to form sidewall spacers 1110 (about 750& thick, with the 
^3 thickness here being determined by interpoly3/2 erase high 
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1 voltage isolation requirements, for example, about 25v) . The 

2 arsenic BN+ strips are then implanted. 

3 Referring to Figures I2d(i) and 12d(ii) , a new patterned 

4 photoresist layer 1111 is added to cover and protect BN+ strips. 

5 The exposed polyl over channel strips is etched, to expose the 

6 selected channel regions. (Since some of the polyl overlies 

7 channel regions and is therefore thicker, while other portions 

8 overlie field oxide and is thinner, the same considerations for 

9 oxide etch selectivity apply as above, in the oxide over BN+ 

10 etching case.) 

11 Referring to Figures 12e(i) and 12e(ii), once photoresist 

12 1111 is stripped, oxide is formed (e.g. via thermal oxidation or 

13 some composite oxide) to simultaneously form the polyl sidewall 

14 1112 and corner interpoly3/l tunneling oxides 1113 (for example, 

15 about 350&) , the poly3 gate oxide 1114 over the select channel 

16 and oxide 1115 over BN+ (e.g. less than about 30oA - the 

17 requirement for both of these oxides being they must be 

18 sufficiently thick to reliably hold up to the erase voltage to 

19 substrate differential) . A select transistor threshold adjust 

20 implant can be optionally introduced at this time (e.g. 

21 increasing channel dopant concentration to raise select Vt, or 

22 introducing compensation implant to reduce select Vt) . 

23 Referring to Figures 12f(i) and 12f(ii), after deposition and 

24 patterning of poly3 (which in one embodiment is polysilicide in 

25 order to reduce word line delay) the basic dual gate cell 

26 structure is complete. In one embodiment of this invention, a 

27 high electrical field region is enhanced in the channel far away 

28 from the reverse field region located in conventional devices 

29 near the drain and source regions. This is achieved, for 

30 example, by utilizing regions 1200 of increased doping 

31 concentration at the boundary between the channels 1201 and 1202 

32 and transfer channel region 1203. In one embodiment, the width 
3 3 of region 1200 is on the order of 0.1 microns. 
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1 Using the above dimension and film thickness example values, 

2 the total floating gate capacitance becomes about 0.4 

3 femptoFarads, and coupling ratios are approximately: Steering 
Gate (Ml) 70%; Erase Gate 20%; - Floating gate to Substrate & 
Drain 10%. Although this R21 value may vary from this figure 

o somewhat in that fringing fields from the other terminals are not 

7 accounted for, this approximation indicates, adequate coupling 

8 ratios are achieved in the dual gate cell, even under aggressive 

9 cell scaling. 

10 A process variant of the above approach, which can reduce 

11 further still the erase coupling, is to completely fill the 

12 region over BN+ with an oxide, after BN + formation. Thxs is 

13 done, for example, by depositing a sufficiently thick, 

14 undensified (and hence easily etched away compared to underlying 

15 densified oxide films) oxide layer, patterning photoresist strips 

16 over the BN+ to protect it from etching, and etching away the 

17 exposed, undensified film over the select channel strips. 

18 Following this step and resist removal, the poly3/l tunnel oxide 

19 process proceeds" as outlined above, during which time the oxide 
2 0 filler over BN+ is densified. 

21 The above approach and its variant outlines one of a number 

22 of possible ways to implement the above described dual floating 

23 gate cell into the desired array. 

24 in summary, several concepts have been introduced to 

25 implementing the TWIN FG cell. 

26 Fundamental to the cell is its low power source side 

27 programming mechanism, and low power row oriented poly-to-poly 

28 erase element. Additionally, its independent steering and 

29 selection functions, facilitates low power, multi-state read and 
3 0 programming operations. 

31 ONO interpol Y 2/l is readily integrated to provide a high 

32 capacitive coupling, ultra-low leakage steering element. One 
3b embodiment uses a full column segment confinement architecture to 
3 4 substantially reduce parasitic bit line capacitance and leakage. 
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1 A negatively shifted voltage steering implementation allows 

2 reduction of maximum voltage ceilings for both the poly2 steering 

3 lines during programming and the poly3 word/erase lines during 

4 erase. Under such implementation, one preferred embodiment for 

5 the column segmented array architecture is via an N-well isolated 

6 P channel steering selection matrix. 

7 High steering ratio is achieved by the narrow channel plus 

8 field oxide approach to allow formation of wings. A preferred 

9 embodiment is described which reduces the interpoly3/l parasitic 

10 as part of a self -aligned BN+ formation process. This replaces 1 

11 the thinner tunneling oxide adjacent one of the two potential 

12 tunneling edges with a much thicker isolation oxide. Based on 

13 the example used, this approach can give a cell with steering 

14 coupling ratio approaching 70%, and parasitic erase coupling down 

15 to 2 0%. Furthermore, based on that example, which uses a 0.25/; 

16 technology for the 4*lambda 2 dual floating gate, poly3 word/erase 

17 line cell (where lambda is the minimum technology feature size) , 

18 a physical cell area of 0.25/x 2 is realizable, which for 8 (16) 

19 level of multi-state translates to an effective cell size 

20 approaching ~0.08/x 2 (~0.06/i 2 ) per logical bit. These small sizes, 

21 around 100 times smaller than physical sizes of cells used in the 

22 4 MEG and 8 MEG generation of Flash memories,, are suitable for 

23 building Gigabit density level Flash memories with comparable die 

24 sizes and at comparable cost per die. 

25 All publications and patent applications mentioned in this 

26 specification are herein incorporated by reference to the same 

27 extent as if each individual publication or patent application 

28 was specifically and individually indicated to be incorporated by 

29 reference. 

30 The invention now being fully described, it will be apparent 

31 to one of ordinary skill in the art that many changes and 

32 modifications can be made thereto without departing from the 

33 spirit or scope of the appended claims. 
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